Down-line transcription system using automatic tracking and revenue collection

ABSTRACT

A transcription network having linked computer terminals for a court reporter and for examining, defending, and associate attorneys is disclosed. Real-time transcription text is communicated from the reporter&#39;s terminal through the network. Secure message interchange between the examining attorneys and between defending attorneys is provided along independent communication links and using encryption. Off-site terminals for clients, experts, or other associate attorneys may be linked to the network. A synchronization scheme provides for automatic play-back of audio and video recordings corresponding to the transcribed text. One embodiment of the system provides for billing, monitoring, collection and authorization of transcription events.

CROSS REFERENCE TO RELATED APPLICATIONS

This application is a continuation-in-part of application Ser. No. 09/057,650 filed Apr. 9, 1998, now U.S. Pat. No. 5,970,141 issued on Oct. 19, 1999, which is a continuation of application Ser. No. 08/339,771 filed Nov. 15, 1994, now U.S. Pat. No. 5,740,245 issued Apr. 14, 1998, which is a continuation of application Ser. No. 08/036,488 filed Mar. 24, 1993, now U.S. Pat. No. 5,369,704 issued Nov. 29, 1994.

This application is also a continuation-in-part of application Ser. No. 09/185,114 filed Nov. 3, 1998, now U.S. Pat. No. 6,023,675 issued on Feb. 8, 2000, which is a divisional of U.S. application Ser. No. 08/818,402 filed Mar. 12, 1997, now U.S. Pat. No. 5,949,952 issued Sep. 7, 1999, which is a continuation of U.S. application Ser. No. 08/066,948 filed May 24, 1993, now U.S. Pat. No. 5,878,186 issued Mar. 2, 1999, which is a continuation-in-part of U.S. application Ser. No. 08/036,488 filed Mar. 24, 1993, now U.S. Pat. No. 5,369,704 issued Nov. 29, 1994.

This application also claims priority under 35 U.S.C. 120 to provisional application Ser. No. 60/140,760 filed Jun. 24, 1999.

INCORPORATION BY REFERENCE

Applicant hereby incorporates by reference U.S. Pat. No. 5,369,704, U.S. Pat. No. 5,878,186, and provisional application Ser. No. 60/140,760 herein in their entirety.

BACKGROUND OF THE INVENTION

This invention relates to a transcription system used by court reporters; and, more particularly, it relates to a method and apparatus incorporating an automatic transcription system for providing real-time use and manipulation of transcribed testimony by attorneys, judges, court reporters, witnesses and clients.

As is well known, legal proceedings such as a deposition or trial involve the participation of, among others, an examining attorney who asks questions and a witness who must answer (“testify”) while under oath. These answers (“testimony”) are recorded by the court reporter, along with the associated questions and related conversation, using a stenographic recorder. A stenographic recorder is a machine which provides a set of keys which are stroked by the court reporter in various combinations and sequences to represent a spoken word. To provide a backup to the key-strokes, court reporters use a tape recorder to record the entire proceeding.

Older versions of the stenographic recorder only record the court reporter's key-strokes much the way a computer printer does, creating a paper tape containing the printed key-strokes as a record of the legal proceeding. To the untrained eye, the printed key-strokes are incomprehensible. After the legal proceeding ends, the court reporter reads the paper tape and manually transcribes each printed key-stroke back into the words that were spoken, creating a readable transcript of the testimony. This manual process is herein called “manual, post-processed transcription”.

Newer versions of stenographic recorders have been developed and linked to computer aided transcription (“CAT”) systems to help automate the manual transcription process. Instead of solely using paper tape recording methods, the stenographic recorder also electronically stores key-strokes in built-in memory or on disk. After using such a newer recorder, the court reporter returns to his office and transfers the electronically stored key-strokes to his CAT system for transcription. Although the bulk of electronically stored key-strokes may be translated automatically, the court reporter must still work interactively with the CAT system to translate those key-strokes which the CAT system could not recognize. This nearly automated transcription process is herein called “automatic, post-processed transcription”.

The most recent versions of the stenographic recorder also accommodate “automatic, parallel transcription”, by electronically sending each electronically registered key-stroke via a telephone line to a remote CAT system during the legal proceeding. This system requires two court reporters working in parallel: one using the stenographic recorder to record the testimony; and the other, who is familiar with the first reporter's key-stroking style, interacting with the CAT system to carry out the transcription of each key-stroke received. Working in parallel, the court reporters might produce a readable transcript within a few hours after the proceeding has ended.

For all of the transcription systems mentioned above, court reporters often use the tape recorder to aid in translating mis-heard words and untranslated key-strokes. To locate the corresponding audio on a recorded tape often wastes a great deal of the court reporter's time.

Automatic, parallel transcription systems have also been unsuccessfully used in a configuration to assist a deaf witness in legal proceedings. Taking the place of the second, parallel court reporter in the aforementioned normal scenario, the deaf witness is placed in front of the CAT system. The CAT system translates and displays the bulk of the key-strokes and sequences in real-time so that the deaf person can read the questions asked. One major problem found in this configuration is that the untranslated key-strokes are displayed in an undecipherable form to all but a court reporter. A relatively large number of the questions asked become unintelligible. Furthermore, the fact that the witness is deaf complicates this matter to a point of failure. For every unintelligible question encountered, the court reporter must stop using the stenographic recorder, read the computer screen and then write down the translation by hand on a piece of paper for the witness. As an additional problem the witness's own words annoyingly echo back to the computer screen within a short delay after they are spoken, making it very difficult for the witness to testify without distraction.

The development of the automatic, post-processed and parallel transcription system was driven solely by the court reporter's need to accelerate the transcription process. Using these systems, the court reporter operates more efficiently, handles a larger volume of work and, as a result, reaps greater profit. Using any of these transcription systems, however, a readable transcript is never received by an attorney, judge or witness until some time after the immediate legal proceeding has ended.

There are a variety of problems facing attorneys, judges and witnesses relating to the development of testimony taken in a legal proceeding. The focal point of most of these problems involves the lack of a real-time, readable record of a witness's testimony while that witness is undergoing examination. Reliance must be based upon either the minimal assistance the court reporter might offer or notes taken which often prove to be burdensome, confusing and untrustworthy. Further details of these problems become apparent when evaluating the type of legal proceeding involved.

A deposition proceeding often involves the participation of a witness, court reporter, examining attorney, associate examining attorney, defending attorney and associate defending attorney. Generally, the examining attorney asks questions to which the witness responds with answers. The associate examining attorney assists the examining attorney in conducting this inquiry. The defending attorney, with assistance from the associate defending attorney, evaluates the questions being asked and raises objections when legally improper questions are detected. If a legally improper question occurs but is not detected, the corresponding answer may be used to the detriment of the defending attorney's case. During the entire proceeding, the court reporter records all of the questions, answers, objections and discussions held during the deposition.

In parallel with the court reporter's recording of the proceeding, all attorneys rapidly take notes in which they attempt to summarize what they believe to be the important factors of what they believe they heard the witness say. Although these notes provide a necessary, working record to aid in conducting the deposition, they are often incomplete, mischaracterize or, even worse, completely misconstrue a witness's answers. Relying on such notes often proves to be detrimental.

If an attorney does not understand an answer, he often asks the court reporter to read the answer back from the paper tape record. The examining or defending attorney may also ask for the last answer to be read back if the answer was so lengthy that notes were not taken fast enough to keep up with the witness. Similarly, either attorney may ask the court reporter to read back the last question for a confused witness. When so instructed, the court reporter stops recording, picks up the paper tape output from the stenographic recorder, searches for the portion of the record at issue, and reads the stenographic key-strokes. This reading is often broken with misplaced delays in pronunciation between phonemes and words making it difficult to understand. Furthermore, because searching for a portion of the record in a series of printed key-strokes proves to be very difficult and time consuming, attorneys are forced to limit requests to have only the most recent Q & A's read back.

More specifically, the examining attorney takes notes in order to help formulate questions during a deposition. These notes are referenced particularly as the witness reveals additional evidence about the same subject. If the deposition runs for several days, the examining attorney often reviews each day's notes during the evening in his hotel room in preparation for the next day of deposition. Reviewing cryptic notes taken many hours earlier often causes further misunderstandings which may not be detected until a readable transcript is received. Operating from unreliable notes, the examining attorney might incorrectly: 1) proceed with irrelevant lines of questioning; 2) reopen resolved, important issues—tipping-off the defending attorneys; or 3) abandon unresolved, important lines of questioning.

A major goal of the examining attorney is to establish the proper form of Q & A's which relate to important issues in a case. In furthering this goal, the examining attorney often attempts to embody one or more of the witness's prior answers as part of a new, summarizing question. Because this new question is often based upon unreliable notes or poor memory, the new question mischaracterizes the prior answers. In response, the defending attorney may correctly object if he detects the mischaracterization from his potentially unreliable notes and poor memory. The objection forces the examining attorney to consider either asking the court reporter to try to find and read back the actual Q & A's at issue or re-asking the entire line of questioning again. More importantly, this objection warns the witness to carefully scrutinize each question relating to this subject before answering. As a result, a battle usually results over superfluous terms and phrases, and obtaining proper form is prevented. Furthermore, even if the examining attorney believes proper form has been achieved, he is often reluctant to verify his belief by having the Q & A read back because it serves to alert the defending attorneys that important information may have come to light. Thus, improper form on critical issues often results.

Leeching away his time for, and therefore the reliability of, his note taking, the examining attorney is also faced with a multitude of other tasks such as: 1) analyzing the response of the witness; 2) formulating a follow up question in view of the witnesses response; 3) analyzing the defending attorneys objections to the pending question; 4) evaluating the demeanor of the witness upon each Q & A; 5) reviewing an outline for the next preplanned question or line of questioning; and 6) writing further notes regarding future lines of questioning to be asked or reviewed that evening at the hotel.

To relieve him of some of the burden, the examining attorney often brings an associate examining attorney with him. The associate does several things including: 1) taking notes to fill gaps in the examining attorneys notes at a break or later that evening at the hotel when the two attorneys plan the next day's questions; 2) ordering and finding documentary exhibits for the examining attorney; 3) writing messages on small pieces of paper (usually POST-IT brand notes) and passing them to the examining attorney; 4) monitoring the passed messages so as to withdraw, supplement or replace them when necessary; and 5) keeping track of the documentary exhibits entered into evidence.

While relieving some of the burden upon the examining attorney, the associate examining attorney creates additional burdens. For example, in suggesting a follow-up question, the associate must rapidly compose a brief message and immediately obtain the attention of the examining attorney before the examining attorney changes the line of questioning. As a result, the examining attorney sometimes forgets his own line of questioning. Moreover, these messages tend to be illegible and too brief for the examining attorney to understand. Without supplementation, these messages serve no purpose but to distract.

To suggest new lines of questioning, the associate may casually draft a rather complete message intending for the examining attorney to read the message at the examining attorney's leisure. However, the mere act of passing the note tends to distract the attention of the examining attorney.

The associate attorney may also pass messages to help counter the defending attorneys' objections or pointing out the witnesses demeanor in response to a question when the examining attorney is looking elsewhere. Again, this distracts the examining attorney.

Upon receiving a message, the examining attorney must decide when to read it, and, once read, must decide whether and when to use it. If prior to reading the message, the examining attorney asks the question contained therein, the message becomes no more than a distraction. If the examining attorney determines that the message contains a good question, the examining attorney either asks that question immediately and discards the note or saves the note for a later time. At times, an attorney may have ten to fifteen notes in front of him that he must deal with. This is particularly the case where the associate knows more facts about the witness than does the examining attorney, or where the associate attorney is the more senior or more experienced of the two.

As previously stated, the defending attorney must also take summarizing notes regarding both the Q & A's in the deposition. From these notes, the defending attorney attempts to deduce upcoming lines of questioning, specific details of the examining attorney's positions, strengths or weaknesses associated with those positions and objections such as “asked and answered” or “mischaracterizing the testimony”. These notes also remind the defending attorney to discuss incorrect statements made by the witness during a break so that the record can be corrected when the deposition resumes.

Among a variety of other duties also detracting from his note taking ability, the defending attorney must object whenever the question is inappropriate and state the reasons for his objections. Not only do these objections address evidentiary concerns, but some attorneys use the objection to warn the witness to scrutinize critical questions posed by the examining attorney as previously described. Furthermore, all such objections must be seasonable, i.e., before the witness answers a pending objectionable question. Note taking tends to detract from the defending attorney's ability to seasonably object and vice versa.

To help alleviate this problem, the defending attorney often brings along an associate defending attorney. Much like the associate examining attorney, the associate defending attorney takes notes and passes messages to the defending attorney. If the note involves making an objection, the defending attorney will rarely be able to make it seasonably. The messages just cannot be drafted, passed and read fast enough. The notes taken do however help fill the gap found in the defending attorney's notes who can then focus on his other duties. In addition, the description of many of the other problems encountered by the examining attorneys are equally applicable here.

During a trial proceeding, both the plaintiff's attorneys and defendant's attorneys take turns assuming the role and duties of the examining and defending attorneys as previously described in the deposition proceedings. In many cases, instead of a single associate attorney, many associate attorneys may participate. The examining attorney at trial is called the “first chair”. The most senior associate attorney, if at least one, is called the “second chair” and so on. There may be several “chairs” representing a single client.

The problems described previously relating to a deposition directly, and perhaps more severely, apply to a trial proceeding. This is due in part to the increased number of attorneys involved. Additionally, several different problems associated specifically with the trial proceeding also exist. For example, because the first chair is often on his feet and not near the other chairs on his team, passing messages or note taking proves to be very difficult if not impossible. This communication gap manifests other problems when the first chair attempts to obtain trial exhibits or other supporting materials. Scrambling to draft and pass a message to, or to reach whispering range of, a lower number chair often occurs. Other chairs that could possibly know of the location of the requested material are often oblivious to the request.

Another communication gap exists between the attorneys at trial and those that cannot attend. Only those attorneys present and listening can respond at the proceeding. Non-attendees who might otherwise be able to assist have no means to evaluate the trial or communicate with the attending attorneys.

On the defending attorney's side, the nomenclature using the first chair, second chair and so on is also used. The problems applicable to the defending attorneys during a deposition are also applicable and compounded at trial because of the enhanced significance of the defending attorney's duties beyond those of detailed note taking. As a result, much of the note taking must be left to other chairs.

A judge must monitor, evaluate, and moderate the procedures and substance of the trial proceedings. Each question or line of questioning, answer and exhibit must be carefully analyzed for content, evidentiary form and relevance so that rulings on related objections can be immediately made. If a judge is distracted, he is faced with problems previously enumerated with having to have the court reporter read back from the record. The judge also takes notes relating to various Q & A's or statements made during trial, particularly when ruling on issues of law or fact from the bench. Due to the multitude of duties facing the judge, the judge's note taking is also subject to many of the other previously mentioned problems.

Court reporters charge revenue on a page by page, word by word or hourly basis, typically, when real time transcriptions systems are employed. The software and often the hardware underlying the CAT system are typically sold to the court reporter. The amount of revenue generated by the CAT system in exchange for the real time transcription functionality provided to attorneys and their clients typically far exceeds the cost of the underlying software and hardware. Considering the limited number of copies sold in view of the limited size of the market, revenue collection via sales of CAT systems has proved difficult to capture by any party other than the court reporters.

Currently, facing the foregoing problems are over thirty thousand court reporters and hundreds of thousands of attorneys and judges in the United States alone. Hence, it would be highly desirable to solve the foregoing variety of problems enumerated above facing attorneys, judges, court reporters and witnesses in conducting legal proceedings such as a deposition or trial by using and manipulating testimony generated in real-time by an automatic transcription system.

It is therefore an object of the present invention to provide a method and apparatus which will aid the examining attorney's use and manipulation of testimony generated in real-time by an automatic transcription system.

It is another object of the present invention to provide a method and apparatus which aids associate examining attorneys in the use and manipulation of testimony generated in real-time by an automatic transcription system to assist the examining attorney.

It is yet another object of the present invention to provide a method and apparatus which aids the defending attorney in the use and manipulation of testimony generated in real-time by an automatic transcription system.

It is another object of the present invention to provide a method and apparatus which will aid the associate defending attorney's use and manipulation of testimony generated in real-time by an automatic transcription system in assisting the defending attorney.

It is therefore an object of the present invention to provide a method and apparatus which will aid the examining first chair's use and manipulation of testimony generated in real-time by an automatic transcription system.

It is another object of the present invention to provide a method and apparatus which will aid the examining second chair's use and manipulation of testimony generated in real-time by an automatic transcription system in assisting the examining first chair.

It is another object of the present invention to provide a method and apparatus which will aid all examining higher chairs, use and manipulation of testimony generated in real-time by an automatic transcription system in assisting the examining second chair.

It is yet another object of the present invention to provide a method and apparatus which will aid the defending first chair's use and manipulation of testimony generated in real-time by an automatic transcription system.

It is another object of the present invention to provide a method and apparatus which will aid the defending second chairs' use and manipulation of testimony generated in real-time by an automatic transcription system in assisting the defending first chair.

It is another object of the present invention to provide a method and apparatus which will aid all defending higher chairs' use and manipulation of testimony generated in real-time by an automatic transcription system in assisting the defending second chair.

It is a further object of the present invention to provide a method and apparatus which will aid the judge's use and manipulation of testimony generated in real-time by an automatic transcription system.

SUMMARY OF THE INVENTION

These and other objects of the present invention are achieved in a transcription network having a reporter's terminal, an attorney's terminal, and an associate's terminal with a communication link therebetween. Specifically, the attorney's terminal and associate's terminal receive text signals representative of spoken words from the reporter's terminal via the communication link, and display the text. The associate's terminal also provides a means for generating and sending messages via the communication link to the attorney's terminal.

Other objects are also achieved in a transcription system having a terminal, for transcribing signals representative of spoken words into corresponding alphabetic and numeric text, which is linked to a recorder for recording spoken words. A means which operates across the link creates associations between recorded spoken words and corresponding alphabetic and numeric text.

Objects are also achieved in a reporting system having a transcriber node which operates on coded representations of spoken words and having cross-reference and phoneme libraries. Specifically, the reporting system uses the cross-reference library to identify associations between coded representations of spoken words and corresponding alphabetic and numeric text. Where associations cannot be identified, the reporting system then uses a phoneme library for associating spoken words with pronounceable text.

Objects are also achieved in a reporting system operable on coded representations of spoken words. Particularly, the reporting system comprises a transcriber which cross-references the coded representations of spoken words so as to identify corresponding alphabetic and numeric text. The identified text is then analyzed and, if appropriate, objections to the spoken words are provided.

Objects are also achieved in a transcription network comprising a plurality of terminals each having a display screen which is electrically controllable for visually displaying alphabetic and numeric text. A hierarchical communication link controls the communication flow between the plurality terminals. Each of the terminals send and receive messages via said hierarchical communication link, which provides for selective relaying of messages between said plurality of terminals.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a perspective view of an embodiment of an overall system configuration according to the present invention.

FIG. 2 is a schematic block diagram of an embodiment of the stenographic recorder used in the present invention.

FIG. 3 is a detailed view of the interconnection of the stenographic recorder and CAT system.

FIG. 4 a is a diagram representing the association of data fields into a record which is used as a basis for the overall data structure for transcription according to the present invention.

FIG. 4 b is a detailed diagram representing the overall data structure used by the CAT system in transcription.

FIG. 5 a is a diagram representing the phoneme library used in the present invention wherein substitutions of readable phoneme text is found for untranslated key-strokes.

FIG. 5 b is an exemplary diagram of a foreign language table which is used to cross-reference text of a first language that corresponds to translated words of a first language.

FIG. 6 a is a perspective view of the examining attorney terminal and its interconnection to the CAT system and stenographic recorder.

FIG. 6 b is a perspective view of the examining attorney terminal's which provides further detail for both the display and keyboard.

FIG. 7 a is a perspective view of the examining attorney terminal displaying the outline mode of the present invention which provides text editing of a case law outline indexing a set of preset messages.

FIG. 7 b is a perspective view of the examining attorney terminal displaying a tree-structured outline providing hierarchical display of the categories of law to be investigated during a deposition.

FIG. 8 is a perspective view of the associate examining attorney terminal which provides details in both the display and keyboard and its interconnection to the examining attorney terminal, CAT system and stenographic recorder.

FIG. 9 is a perspective view illustrating the use of audio or video tracking in the present invention.

FIG. 10 is a perspective view of the wireless and hardwired interconnection of an alternate embodiment of the transcription system in a court room setting.

FIG. 11 is a perspective view of another embodiment of the present invention which cooperatively utilizes stenographic and voice transcription.

FIG. 12 is a diagram illustrating two alternate variations of the present invention where the court reporter terminal is not linked to the transcription network.

FIG. 13 is a perspective view of another embodiment of the present invention where, instead of using a keyboard, the attorney terminals 15 and 17 utilize pen-based or touch-sensitive screen input.

FIG. 14 illustrates a flow-chart describing the communication protocol used by the attorney terminals for communicating with the reporter terminal (or CAT system) 21.

FIG. 15 illustrates an alternate flow-chart describing the reporter terminal's functionality in carrying out the Q&A communication protocol.

FIG. 16 is a perspective view of an alternate embodiment of the present invention not only providing for review and analysis of synchronized audio, video and textual transcripts, but also providing audio, video and text on demand via a remote relaying and stage system.

FIG. 17 is a perspective view of an overall transcription system according to the present invention providing billing monitoring, collection and transcription authorization.

FIG. 18 is a block diagram of one embodiment of the network of FIG. 16.

DESCRIPTION OF THE PREFERRED EMBODIMENT

FIG. 1 is a perspective view of an embodiment of an overall transcription system configuration according to the present invention. A stenographic recorder 11 is used by a court reporter at a deposition, hearing or other transcription matter, to record digital coded signals representative of verbal communications as they occur in real-time. Using a communication link 19, the recorder 11 transfers the representative signals to a computer aided transcription (“CAT”) system 13, a computer terminal used by the court reporter, for transcription into alphabetic and numeric text corresponding to the actual words spoken.

The CAT system 13 communicates the alphabetic and numeric text it generates along two independent communication links 20 and 21 to an examining attorney's terminal 15 and a defending attorney's terminal 16, respectively. The examining attorney's terminal 15 relays the text along a communication link 23 to an associate examining attorney's terminal 17. Likewise, the defending attorney's terminal 16 relays the communicated alphabetic and numeric text representing the spoken words along a communication link 22 to an associate examining attorney's terminal 18. Upon receipt of the communication, the attorney's terminals 15, 16, 17 and 18 display the alphabetic an numeric text and provide a variety of tools for reviewing and evaluating what has been received.

Concurrent with receiving and displaying representative text, the examining terminals 15 and 17 provide a vehicle for the examining and associate examining attorneys to exchange messages. The message exchange occurs through the communication link 23. Similarly, message exchanges between the defending terminals 16 and 18 are provided for via the communication link 22.

As illustrated, the CAT system 13 may be positioned for easy access on a conference room table 25 in front of the court reporter. Also on the table 25, the terminals 15, 17, 16 and 18 may be found positioned in front of an examining attorney, associate examining attorney, defending attorney and associate defending attorney, respectively. Each of the terminals 15, 16, 17 and 18 and the CAT system 13 (the reporter's terminal) all include keyboards 27 and a screen 29. Each screen is flat or planar so as to prevent viewing by the opposing attorneys (and witness) who typically sit on either side of the table, as shown.

Although preferred, neither the keyboard nor the screen are necessary for the CAT system 13. In fact, the terminal itself, i.e., the functionality thereof, might exist within other nodes on the transcription network. For example, the functionality of CAT system 13 might be fully distributed between the recorder 11 and the examining attorney's terminal 15. The functionality might also be located at some remote, off-site location. Similarly, the present invention contemplates many situations where terminals are not available for each attorney present, such as where only the examining side uses terminals or visa versa, or where either side utilizes a single terminal. Situations may arise where no attorneys possess a terminal.

The use of additional terminals is also contemplated by the present invention. Although not shown in FIG. 1, a sixth terminal might be used: 1) for the witness who sits to the left of the defending attorney on the defending attorney's right side; 2) off-site for a magistrate or judge; 3) off-site for a client or expert witness; or 4) off or on site for additional attorneys on the case. These configurations and their advantages and features will be discussed in detailed below.

FIG. 2 is a schematic block diagram of an embodiment of the stenographic recorder 11 used to generate digitally coded stenographic signals representing the voice transcript. Recorder 11 includes a keyboard 31 which is actuated by the court reporter in response to the spoken word. The reporter strokes the keyboard by actuating either one key individually or several keys simultaneously to represent phonetically the spoken word or some phonetic portion thereof. Such a key-stroke causes print actuator 33 to print a single line of type representing that key-stroke onto a paper tape 35. Tape 35 is thereafter incrementally scrolled out in preparation for the next key-stroke.

During the deposition, the court reporter may be asked to read back testimony. The reporter does so by sequentially viewing each line of phonetic characters typed onto paper tape 35, mentally translating a single or several lines of characters into words, and then speaking out the phonetic translation. An attorney, judge or witness typically cannot read the phonetic characters typed onto tape 35.

In addition to being printed on paper tape 35, each key-stroke is converted to a digital code which is stored in a random access memory (RAM) 37 and also transmitted by a communication port 39 to CAT system 13 via a serial output bus 41 and communication link 19 (FIG. 1). If print actuator 33 runs out of tape 35, stenographic recorder 11 will still store and transmit the digital key-stroke codes. Similarly, if communication link 19 is disconnected, storage and printing will still occur.

More specifically, a keyboard scanner 43 monitors keyboard 31 via scanning bus 45. Upon each stroking of keyboard 31, keyboard scanner 35 responds by generating a digital key-stroke code representing all of the keys pressed in that stroke. Next, keyboard scanner 43 interrupts a central processing unit (“CPU”) 47 via an interrupt line 49. CPU 47 responds to the interrupt by placing the address of keyboard scanner 43 upon an address bus 51. In response, an address decoder 53 generates a select signal on a scanner select line 55 which enables keyboard scanner 43. CPU 47 then places a read enable signal on a read line 57. Keyboard scanner 43 responds by placing the digital key-stroke code on a data bus 59. Next, CPU 47 reads the key-stroke code into an internal key-stroke register 61 and increments a print count register 63, a transmit count register 65 and a memory pointer register 67. The count in register's 63 and 65 represents the back-log of key-stroke codes to be printed and transmitted, respectively. Immediately after the key-stroke code has been read, keyboard scanner 43 removes the interrupt and awaits the court reporter's next stroke.

Having retrieved the key-stroke code, CPU 47 must first save the code and then attempt to transmit and print it. Specifically, from the contents of memory pointer register 67, CPU 47 computes the next available address in RAM 37 and places that address onto address bus 51, while placing the contents of key-stroke register 61 onto data bus 59. In response to the address on address bus 51, address decoder 53 produces a select signal on RAM select line 69, selecting RAM 37. Afterwards, CPU 47 places a write enable signal upon a write line 71, and RAM 37 responds by storing the key-stroke code.

Having saved the key-stroke code, CPU 47 enters a prioritized wait-mode, awaiting an interrupt signal from: 1) keyboard scanner 43 along interrupt line 49, signifying that another key-stroke code has been detected and generated; 2) communication port 39 along interrupt line 73, signifying that port 39 is ready to receive a key-stroke code for transmitting; or 3) print actuator 33 along interrupt line 75, signifying that actuator 33 is ready to receive another key-stroke for printing. Interrupt servicing priority is given to scanner 43, followed by port 39 and actuator 33.

This servicing scheme does not provide first in first out servicing of interrupts. Instead, at the conclusion of servicing any interrupt, CPU 47 enters the prioritized wait-mode and, first, unmasks interrupt line 49 from scanner 43. If an interrupt is present, CPU 47 exits the wait-mode and services scanner 43. Otherwise, with no interrupt from scanner 43, CPU 47 examines the backlog count in registers 63 and 65. If neither contain a backlog, CPU 47 waits solely for an interrupt from scanner 43.

If one or both contain a backlog, priority is given to the register with the largest backlog or to transmit count register 65 if the registers are equal. With this priority, CPU 47 unmasks the corresponding interrupt line. If the corresponding device has provided an interrupt signal signifying that it is ready, CPU 47 will exit the wait-mode and service that device. Otherwise, if that device is not ready, CPU 47 will unmask the final interrupt line. At this point, CPU 47 will service the first interrupt that occurs. Finally, before servicing any interrupt, CPU 47 masks all interrupts.

Having described the prioritization scheme and the servicing of keyboard scanner 43, we move on to servicing communication port 39. As previously detailed, during the servicing of scanner 43, transmit count register 65 contains a backlog count of at least one, signifying the need to transmit. The count in register 65 directly represents the binary number of key-strokes that have been stored but not transmitted. If register 65 contains a binary value of one at the time CPU 47 responds with servicing, CPU 47 will place the address of communications port 39 onto address bus 51, while placing the contents of key-stroke register 61 (the recently received key-stroke code) onto data bus 59. Address decoder 53 responds by placing an enabling signal on port select line 77, selecting communications port 39. CPU 47 next places a write enable signal upon write line 71, causing the key-stroke code on data bus 59 to be written to communication port 39. Upon receiving the key-stroke code, port 39 removes the interrupt signal from interrupt line 48 until the key-stroke code is transmitted to CAT system 13 via communication link 19 (see FIG. 1). CPU 47 exits the servicing routine and enters the prioritized wait-mode.

If, however, transmit count register 65 reveals a greater backlog than just one, CPU 47 must retrieve the oldest un-sent key-stroke code from RAM 37 before sending it to communication port 39 as described above. To retrieve the oldest un-sent key-stroke code, CPU 47 calculates the address of the oldest key-stroke code by subtracting the contents of transmit count register 65 from memory pointer register 67 and placing that value on address bus 51. Address decoder 53 responds by selecting RAM 37 via RAM select line 69. Next, CPU 47 provides a read enable signal on read line 57 which causes RAM 37 to place the oldest un-sent key-stroke code onto data bus 59. CPU 47 then reads this key-stroke code into an auxiliary register 79, completing the retrieval. Finally, the same steps described above are engaged in by CPU 47 to write this key-stroke code (stored in auxiliary register 79) to communication port 39.

In an identical manner, responding to interrupt line 75, CPU 47 services print actuator 43 utilizing address decoder 53's selection of actuator 33 (via actuator select line 81) and either print count register 63 or auxiliary register 79 as necessary. The details therefore need not be repeated.

A read only memory (“ROM”) 83 provides the instructions or program for CPU 47 which defines the operation described above. This program also defines the initialization and mode of operation of stenographic recorder 11. ROM 83 is accessed much the same way RAM 37 is accessed, using ROM select line 85 from address decoder 81.

Additionally, although not shown, RAM 37 can be backed-up with batteries, providing a relatively permanent storage of the record for later retrieval. Otherwise, stenographic recorder 11 can service a disk drive system (not shown) which could either operate in parallel with RAM 37 but at a lower priority in the servicing scheme, or completely replace RAM 37. With either scenario, the disk drive system would create the permanent storage on a diskette.

As a final matter, while in the prioritizing wait-mode, CPU 47 can evaluate the backlog of the devices which are being serviced and read the status of a severely back-logged device to identify a problem. Warning messages can then be sounded or displayed to the court reporter. This might also be accomplished by using additional interrupt lines signaling malfunctions.

FIG. 3 is a detailed view of the interconnection of stenographic recorder 11 and CAT system 13. As previous described, in response to the court reporter stroking keyboard 31, recorder 11 creates a key-stroke code and transmits this to the CAT system 13 (the reporter's terminal) along communication link 19. Key-stroke codes received are displayed in two different formats on a screen 100 of CAT system 13. In a tape window 101 which is shaped and sized much like paper tape 35, alphabetic representations for received key-stroke codes are displayed in a line by line fashion wherein each key-stroke code is represented on a single line. Tape window 101 thus provides an electronic version of a paper tape which is generally in the format familiar to a court reporter.

Screen 100 also includes a transcription window 103 which displays the alphabetic and numeric text representative of the spoken words, i.e., displays the transcribed version of the key-stroke codes received from recorder 11. Through user selection, the windows 101 and 103 can be configured to update the screen 100 upon complete transcription of each: 1) word; 2) sentence; 3) Q & A; or 4) Q or A. The display can also be chosen to operate in either scrolling or paging fashion. For example, with a preferred display selection, the transcription window 103 will only display the current Q & A as it is developed on a sentence by sentence basis. After completing the transcription of the first sentence of a new question, the CAT system 13 clears the currently displayed Q & A (the old Q & A) from the window 103, and displays the transcribed alphabetic and numeric text of the first sentence of the new question. As each additional sentence of the new Q & A is received and transcribed, it too is added to the window 103. After fully transcribing the new Q & A, window 103 holds the display of that new Q & A until the end of the first sentence of the next Q & A; upon which, this cycle repeats.

To set up the mode of updating the windows or to access any of a variety of other CAT system 13 commands, the screen 100 also includes a command line 105. In one embodiment, the command line 105 lists several one-word commands displayed horizontally in a single line of text. The commands listed by the command line 105 can be selected directly using a mouse, pen, fingertip or other input device (none of which are shown) or indirectly by simultaneous pressing the “alt” key and the first letter of the desired command on the keyboard 107. Any other mode or method of command selection involving, for example, button bars, pull-down menuing, or icons might also be used.

As previously stated, a court reporter strokes the stenographic recorder 11 in response to the spoken word. Generally, a given word may demand either a single stroke or multiple strokes based on the complexity of its phonetic composition. Upon completing the strokes necessary for a given word, the court reporter immediately begins stroking the next word. No space bar or other word separator is stroked between words. As a result, all of the strokes run together. At the end of a sentence, the court reporter does stroke recorder 11, however. The CAT system 13 operates on this sequence of key-strokes to determine the beginning and end of each word.

Providing the foundation of the transcription process, the court reporter uses a personalized cross-reference library which lists the readable text of every word that the court reporter has translated to date. Each of these words are associated with the corresponding stenographic key-stroke or sequence of key-strokes that represents that word. This cross-reference library can be displayed in a visual manner, but is stored in binary tree structure for efficient searching.

Upon receiving the first key-stroke code of the sentence, CAT system 13 begins transcription. This first code is compared to all of the possible first stroke codes within the court reporter's cross-reference library. If a complete cross-referenced word is found, CAT system 13 preliminarily treats this as the entire word. Then, upon receiving the second key-stroke code in the sentence, CAT system 13 treats this second key-stroke code as the beginning of another word and looks again to all of the possible first stroke codes within the library. If that code is not found, CAT system 13 must back-track. The first and second codes are combined and are then compared to all of the possible first and second stroke codes within the library. If a complete cross-referenced word is found, CAT system 13 replaces the shorter, single stroke word with this new, two stroke word, and preliminarily treats this as the entire first word. Next, a third code is received and is similarly treated as the beginning of another word, and the cycle repeats.

By the end of the sentence, CAT system 13 provides the best transcription possible. Key-strokes that still remain untranscribed at that point are translated using a phoneme library into readable phoneme representations of those key-strokes to aid both the court reporter and, more importantly, those lay-persons down-line. By pronouncing these phonemes, the lay-person might identify the actual word or words which the cross-reference library did not contain. Substituting the phoneme representation with the actual word or words causes the cross-reference library to be updated.

Once fully transcribed using the cross-reference and phoneme libraries, the entire sentence is communicated down-line to attorney terminals 15 and 16 via communication links 21 and 20, respectively.

More specifically, FIG. 4 a is a diagram representing the association of data fields into a record which is used as a basis for the overall binary tree, data structure used in transcription according to the present invention. In particular, CAT system 13 utilizes a linked-list arrangement of two types of data records: a key-stroke code listing (KCL) record 151 and a corresponding text (CT) record 153. These two types of records provide the storage structure for the court reporter's personalized cross-referencing library.

Basically, CAT system 13 uses records 151 and 153 to associate each individual key-stroke code with as many subsequent key-stroke codes to represent a given transcribed word. KCL record 151 associates: 1) a listed key-stroke code (LKC) field 155, for storing a specific key-stroke code; 2) a listing optimizing counter (LOC) field 157, for storing a count used for optimizing future searches; 3) a first KCL_record pointer field 159, for storing a pointer to the next KCL record on this level; 4) a CT record pointer field 161, for storing a pointer to an associated CT record; and 5) a second KCL_record pointer field 163, for storing a pointer to a corresponding KCL_record at the next listing level down. Similarly, CT record 153 associates: 1) a CT string field 165, for storing a string of text; 2) a text optimizing counter (TOC) field 167, for storing an additional count used for optimizing transcription; and 3) a CT record homonym pointer field 169, for storing a pointer to another CT record containing a homonym to the contents of CT string field 165.

FIG. 4 b is a detailed diagram representing a portion of the overall binary, linked-list data structure used by CAT system 13 in transcription. KCL records 200, 201 and all KCL records (not shown) directly to the right and left of records 200, 201 constitute a first listing level. This first listing level is no more than a linked-list of the each beginning stroke in the stenographic representation of the words held in the court reporter's library. Upon receiving the first code of a sentence, that code is compared with each code stored in each KCL record on the first listing level. If, for example, the received code does not match the stored code in KCL record 200, KCL record 200 points to the next KCL record 201 for a similar comparison to the code stored therein. In this manner, by stepping through the first listing level, a matching KCL record can be found.

Assuming that the code stored in KCL record 200 does match the first key-stroke code received, the associated CT records 205 and 207 can be accessed to retrieve readable cross-referenced text. KCL record 200 can also be used as a back-tracking point. If the word transcribed requires two key-strokes, KCL record 200 points to the second listing level. This second listing level is specifically associated with KCL record 200 and begins with KCL records 202 and 203 followed by all KCL type records (not shown) to the right of record 203. Any second code received which follows a first code which matches that stored in KCL record 200 is compared to the codes stored in the KCL records on the second listing level. KCL record 204 represents yet a third listing level under the key-stroke sequence stored in record 200 and 203, and so on. CT records may or may not be associated with a given KCL record. KCL record 202 exemplifies such a situation. Additionally, only a single CT record is generally associated with a single KCL record, such as is shown with KCL record 203 and CT record 209. Only when homonyms exist will there be multiple CT record association as shown with KCL record 200 and CT records 205 and 207.

Particularly, upon receiving a first key-stroke code from stenographic recorder 11, CAT system 13 begins a transcription expedition by parsing through a first listing level in an attempt to find a matching KCL record. If a matching KCL record is found which has an associated CT record transcription, CAT system 13 decides there is a preliminary transcription and treats the second (next) code received as the beginning of a new word, parsing the first listing level for a matching KCL record.

If a matching KCL record is found for first code received yet without an associated CT record transcription, the second code received is compared for a matching KCL record at the second listing level, if a second level beneath the matching KCL record on the first level exists. If a match is found at the second listing level with an associated CT record transcription, the third key-stroke code is treated as the beginning of a new word, and parsing at the first listing level repeats.

If after a match with associated transcription text a key-stroke yields a dead end, back-tracking must occur. The requires picking up where the last successful parsing expedition left off, searching deeper listing levels to see if what had been considered an entire word is really only a portion thereof. If a match is found with an associated CT record transcription, that transcription at that deeper listing level is stored, and the next key-stroke code received is treated as the beginning of a word and the cycle repeats.

With each successive, unsuccessful parsing round, the previously described transcription process becomes more and more complex with many parallel and nested transcription pathways being considered. If available, the first completely transcribed sentence found is communicated to attorney terminals 15 and 16.

If a completely transcribed sentence is unavailable, a second phase of transcription, the phoneme transcription phase, is invoked. In this phase, as illustrated by FIG. 5 a, each untranscribed key-stroke is cross referenced into a phoneme replacement table or phoneme library 220 to obtain readable phoneme text. Specifically, the CAT system 13 parses through a series of listed key-stroke codes held in a listed_keystroke_code field 223 of the phoneme library 220 to locate the key-stroke code entry which corresponds to the untranscribed key-stroke code. Once located, the CAT system 13 extracts the readable phoneme text stored in a corresponding_phoneme field 221 of the phoneme library 220. Thereafter, extracted text is substituted for the untranscribed key-stroke in the sentence. Once all of the untranscribed key-strokes of the sentence have been replaced with corresponding phoneme text, the sentence is communicated down-line to the attorney terminals. For example, if the CAT system 13 receives a key-stroke code representing the key-stroke “W A Z” for which no counterpart exists in the cross-reference library, the CAT system 13 parses the listed_key-stroke_code fields 223 and locates the specific entry in a field 225. The corresponding readable phoneme text entry “was” in a field 227 is then extracted and substituted for the untranscribed key-stroke code in the sentence.

In addition, if homonyms are found, signifying multiple CT record association, they are displayed within parenthesis. For example, the homonyms “their”, “there” and “they're in the question “Were you there?” would be displayed as:

Q. Were you (there|their|they're)?

Although readable, to correct such common homonym transcription problems, another level of transcription is contemplated using so called “artificial intelligence” techniques for determining the correct word grammatically in the context of the given sentence.

As a final matter associated with the data structure, optimizing counters are placed in each record so as to provide optimization of the overall structure. Each time a record is used in the final translated sentence, the associated optimizing counters are incremented. At the end of a deposition, the court reporter can run an optimizing routine which sorts records in each linked list based on the count in the optimizing counters. In this way, the most common matching KCL records are encountered first, speeding up the parsing response time. Similarly, the most common homonyms will appear first in the parenthesis.

Referring back to FIG. 1, CAT system 13 communicates a sentence down-line to terminals 15,16,17,18 when that sentence has been fully transcribed. Full transcription takes place whenever CAT system 13 receives an end of sentence key-stroke code (“EOS”) from stenographic recorder 11. If, however, while in mid-sentence, the witness pauses to think, an EOS will not be received. To satisfy an attorney who wants to take this opportunity to review what has been stated thus far, an artificial EOS can be generated. An artificial EOS is generated if either: 1) the pause between the court reporter's key-strokes exceeds a preset time limit (programmable from zero to one hundred and twenty seconds); or 2) an attorney down-line presses the space bar on terminal 15, 16, 17 or 18 indicating via the corresponding communication links 20,21,22,23 the desire to display what has been received thus far. In response to an artificial EOS, CAT system 13 completes the translation as if an EOS has been received, attaches an artificial EOS to the end, and sends the transcribed “partial” sentence down-line. Thereafter, if the EOS is the very next key-stroke received, CAT system 13 merely communicates an EOS down-line to replace the artificial EOS. Alternatively, if further non-EOS key-strokes are received before receiving an EOS, CAT system 13 completes the transcription of the entire sentence with the EOS and sends it down-line. Terminals 15,16,17,18 over-write the “partial” sentence with the complete sentence received. In the alternative, terminals 15,16,17,18 might instead merely display partial sentences, only writing full sentences to file.

Depositions involving participants in two or more languages are also contemplated by the present invention. In a first embodiment, the transcription process operates as previously described by providing a word for word transcription of the spoken question in a first language. Each of these words are then cross-referenced into a second language table to generate a sentence in a second language. Although the use of the second language table can be assigned to each attorney terminal down-line, in a preferred mode, the CAT system 13 performs the task. Once translated, the CAT system 13 transmits the second language sentence down-line to viewing participants who understand that second language. By using additional language tables, the CAT system 13 can provide translations for as many languages as is needed.

If the CAT system 13 performs the translation task, the CAT system 13 also manages the specific down-line routing of first and second language sentences, automatically. To accommodate automatic routing, all participants are required to identify their viewing language and their role in the deposition upon logging in to the system 13. The CAT system 13 stores the identified information in a log-in list. By referencing the log-in list, the CAT system 13 automatically configures itself so as to provide each participant with a translation in their appropriate language.

An exemplary second or foreign language table 231 is illustrated in FIG. 5 b. Using the language table 231, the CAT system 13 parses the text entries of a first language in a series of identified_CT_String fields 233 to locate the transcribed text of a word in a first language. Once located, the text of a second language is extracted from a corresponding_translation field 231 and placed into a second language sentence. Each transcribed word in the first language undergoes this process so that a second language sentence can be fully constructed. Thereafter, both the first and second language sentences are transmitted down-line to the appropriate participant. For example, the word “good” located in a field 235 in a first language, English, would be directly substituted by the word “bueno” of a second language, Spanish, contained in field 237. Although translations would not always be exact because of word context, the use of a human translator could be minimized. Moreover, as a supplement to the language table 230, word context analysis might also be implemented to increase the accuracy of the translations.

If the down-line participants that speak during the deposition all share a common language, the court reporter need only understand the common language and provide a cross-reference library and a phoneme library for that language. Alternate language translations for non-speaking, down-line participants, are provided using as many different foreign language tables as needed, as described above. Routing and language information is retrieved from the log-in list.

If, however, the down-line participants that speak during the deposition share two or more languages, in a preferred embodiment, a multi-lingual court reporter provides a cross-reference library and phoneme library for each of the languages being spoken. Additionally, for each spoken language, language tables providing translation into each of the multiple languages are also necessary. During the translation process, when the multi-lingual court reporter indicates the beginning of a question or answer, or otherwise identifies the speaker, the CAT system 13 automatically selects the appropriate cross-reference and phoneme libraries to transcribe the spoken language, then accesses the appropriate foreign language tables to construct sentence translations. The resulting translations are automatically routed down-line. The CAT system 13 performs the automatic selection of libraries and tables and routing with reference to the entries in the log-in list.

Although the use of multiple phoneme libraries is preferred in a multiple language deposition, they may be completely ignored. Otherwise, when attempting to translate phoneme text into a second language, the phoneme text should be replaced by equivalent phoneme text in the second language. Therefore, pronunciation of the spoken word would sound the same in either language. Because with phoneme text the actual transcription of a word is unknown, a second language translation cannot be found. Therefore, as a substitute, a pronunciation for each of the participants in the phonemes of their language which sounds like the spoken word in the first language is provided.

Referring to FIG. 6 a, examining attorney terminal 15 includes a screen 251 and a keyboard 253. Screen 251 is split into a transcription window 255 and a communication window 257, having a common command line 259. The display of window 255 is very similar to that of window 103 in CAT System 13 (FIG. 3), displaying the transcribed text and having the same setup response choices. This transcribed text is received by terminal 15 from CAT System 13 in coded form along communication link 21 in a sentence by sentence fashion as described above.

Even though transcription is received on a sentence by sentence unit basis, in the preferred mode of response, the sentences units received are not displayed in window 255 until an entire question or responding entire answer has been received. Three indicators are used to signify the end of a question (EOQ) or end of an answer (EOA): 1) a court reporter's key-stroke signifying the beginning of an answer, clearly indicating an EOQ and vice versa; 2) the expiration of a preset delay (programmable from zero to one hundred and twenty seconds) since the last received sentence, providing an artificial EOQ or EOA; and 3) the examining attorney pressing a space bar 261, indicating a desire to see what has been received thus far and providing an artificial EOQ or EOA. Because the second and third indicators only signify that a question or an answer has possibly ended, they are updated upon receiving a true EOQ or EOA.

As illustrated in FIG. 6 b, the question begins in window 255 with the symbol “Q” followed by the question. For example, FIG. 6 b shows the display:

Q. When did that occur?

Afterwards, upon receiving the entire transcribed answer (signified by an EOA or an artificial EOA), the answer appears in window 255 directly below the question. The answer appears on window 255 following the symbol “A”. For example, FIG. 6 b shows the answer:

A. Around noon.

If the remaining room in window 255 is such that the Q & A cannot be simultaneously displayed, the answer is displayed at the bottom-most portion of window 255 while the remaining room is allocated to the end of the question.

Each Q & A remains displayed until the next transcribed question is received in its entirety. This is defined by either the receipt of an EOQ or an artificial EOQ. Thereupon, the last Q & A is cleared from window 255 and the entire next question is presented at one time, and the Q & A cycle repeats.

By permitting only the display of one Q & A unit at a time, transcription window 255 remains uncluttered, and the examining attorney can easily focus on the pending question and\or the last answer. Thus, for each successive Q & A unit, window 255 is cleared and then displays the full, transcribed text of the question followed by its corresponding answer.

In a preferred mode, each Q & A unit is associated with a different number so as to provide a reference to specific Q & A's. The numbers are given in consecutive order for ease of reference. As shown in FIG. 6 b, the Q & A displayed is associated with the number 57, signifying that the displayed Q & A is the 57th Q & A received. In its entirety, window 255 shows:

-   -   57 Q. When did that occur?     -   57A. Around noon.

Keyboard 253 includes an up arrow key 263 and a down arrow key 265 for scrolling through the Q & A's. For example, stroking the up arrow key 263 causes scrolling upwardly in a line-by-line fashion bringing the previous Q & A into display on window 255. For faster movement between Q & A's, keyboard 253 provides a page up key 267 and a page down key 269. In response to a single stroke, keys 267 or 269 cause paging between entire Q & A's. In addition, keyboard 253 includes a “home” key 270 and an “end” key 271. No matter which previous Q & A is currently being displayed, actuation of end key 271 causes window 255 to display either the last full Q & A or, if no answer has been completed, to the last pending Q. Similarly, home key 270 causes window 255 to display the first Q & A of the proceeding.

Keyboard 253 also includes a “mark” key 273 which permits any Q & A to be marked for later reference. The marked Q & A is indicated in the window by an asterisk symbol appearing on the screen to the left of the Q symbol as shown in FIG. 6 b. Associated therewith, keyboard 253 also includes a “quick-up” key 275 and a “quick-down” key 277 which permit scrolling only through the asterisked marked questions.

In particular, if an important answer or definition has been given by the witness, the examining attorney can mark the Q & A with an asterisk. Whenever the attorney wishes to refer back up to that answer, he or she may merely press quick-up key 275 to recall to the display window 255 the marked questioned with its answer. The attorney may also desire to refer back up to a Q & A marked several marked Q & A's ago. To accomplish this, successive strokes to quick-up key 275 pages backward through previously marked Q & A's. Although stroking end key 271 returns the display in window 255 to the pending question, the examining attorney may desire to sequentially page down through the marked Q & A's as they occurred. The quick-down key 277 provides this ability. In addition, marked Q & A's can be referred to after the deposition in preparation for the next day's deposition. As understood, pertinent questions may also be marked at a time outside of the deposition, for example, later that evening at the hotel room before the next day's deposition.

Accessing a specific Q & A can be achieved through the associated number assigned to that Q & A. By pressing a “number” key 278 followed by the number previously associated with a specific Q & A, terminal 15 locates and displays that Q & A in window 255. A specific Q & A can also be located through a lexical scanning capability accessible through a scan-up key 279 and a scan-down key 281. For example, by pressing scan up key 279 followed by a lexical searching template, attorney terminal 15 scans backward through the previous Q & A's in an attempt to find text matching the lexical template. If a match is found, that Q & A is displayed on window 255, else a “no match found” message is received. Afterwards, stroking scan-up key 279 again pulls up the most recently used lexical template. By merely stroking scan-up key 279 without modification to the template continues the search further upward through previous Q & A's. Scan down key 281 works in a similar manner but scanning in the opposite direction.

In general, the lexical template format and features are similar to those provided in legal searching databases familiar to most attorneys. These templates can be constructed using logical operators, wild card characters, quotes, parenthesis and brackets with specific text to be located. The lexical template ordinarily must be satisfied by a single Q & A. To over-ride this “default” condition, a delimiter must be used. For example, placing a “/A” or a “/Q” delimiter within the template, directs attorney terminal 15 to satisfy that template, or portion thereof, entirely within a single answer or a single question, respectively. A “/QA2” delimiter expands the search to within two consecutive Q & A's; while a “/QA3” corresponds to within three consecutive Q & A's, and so on. Similarly, a delimiter of “/A3” or a delimiter of “/Q5” respectively require satisfaction of the lexical template to within three consecutive answers or within five consecutive questions. Delimiters “/W4”, “/W5Q”, or “/W3A” require the respective satisfaction within: 1) four consecutive words in either a single Q or A; 2) five consecutive words in a question only; or 3) three consecutive words in an answer only.

In addition to searching only the current transcript, lexical searching can also be applied to all other evidence in the lawsuit. The terminal 15 permits either locally or remotely stored transcripts recorded at previous hearings or depositions to be selected and searched through the command line 259. Similarly, all other case evidence such as the pleadings documents or documents produced through discovery may be scanned-in, converted to text using character recognition software, and made available for the lexical searching. Once located through a search, the scanned-in image can be retrieved and displayed or printed for use as an exhibit at the ongoing deposition.

Remotely located legal databases such as Lexis® or Westlaw® can also be accessed and searched through the command line 259 during the deposition. Searching other sources may be used to identify discrepancies in the witness's current testimony, or to help clarify the law in an unexpected area so that appropriate questioning can be directed.

Although boolean logic is used for formulating lexical search requests, a natural language front end is also contemplated. The attorney terminals could provide for user preference, invoking the boolean logic or natural language front end upon request. The lexical formulation for a search is identical whether searching any of the case evidence or legal databases, thus providing uniformity of searching.

The specific placement of the various keys shown on keyboard 253 is not critical. In fact, the illustrative placement in FIG. 6 b is merely set forth for ease of explanation. Additionally, the functionality of these keys may be accessed directly from the command line 259 as explained above.

Pressing a tab key 285, or through direct selection with a mouse, pen or finger-tip, causes toggling between transcription window 255 and communication window 257. The communication window 257 provides a visual display in either a stack mode and an edit mode. As illustrated, in the stack mode, communication window 257 displays the first line of each communication received. These communications originate from three sources: 1) the examining attorney's keyboard 253—personal notes “From: Self” or messages “To:” others; 2) the examining attorney terminal 15—through artificial intelligence (“AI”) algorithms “From: AI” which convey potential objections or through scheduler messages; and 3) the other terminals on the network such as the associate examining attorney terminal 17—messages “From:” others. While in the stack mode, window 257 can display all of the communications from one source, all sources, or any combination of the three sources.

To fully read any communication, the examining attorney merely selects the desired communication from the stack of messages illustrated in the window 257, causing the window 257 to display of the full text of the communication in an edit mode. Communications can also be created, modified, deleted, copied, printed, or communicated to the other terminals, such as associate examining attorney terminal 17 via the communication window 257.

In this embodiment, one side's message exchange along that side's communication link cannot be intercepted by the opposing attorney's not only because the communication links are isolated, but also because of the controls placed on the CAT system 13. The CAT system 13 can only consider relaying communications from one side to the other if a specific request therefor is received. Moreover, even if a specific request is received, the CAT system 13 cannot relay communications unless specific permission has been given to receive such communications. In particular, an attorney on one side sends a communication to the CAT system 13 with a specific request that the communication be relayed a recipient on the opposing side. Once received, the CAT system 13 complies to the request only if the recipient on the opposing side grants permission for the relaying. To grant or retract permission, each side sends a message to the CAT system 13, at log-in or at any subsequent time, indicating whether or not requests to relay messages from the other side should be honored. If not, the CAT system 13 responds to each request by sending a do-not-disturb (DND) message to the requesting terminal. If so, the CAT system 13 responds by relaying the message from the sending side's communication link to the receiving side's link.

In either the edit or stack display modes, communication window 257 interfaces with a mouse, keyboard 253 and associated keys in much the same way as does transcription window 255. For example, up and down arrow keys 263 and 265 and page up and down keys 267 and 269 respectively permit scrolling and paging through the stack or text. However, the scan-up and scan-down keys 279 and 281 operate in a slightly different way. In particular, as a default, lexical scanning only parses the currently displayed stack, i.e., the currently selected communication sources. A global command delimiter “/G” overrides the default so that all communications from all sources can be scanned. The following use of delimiters can override the default: a “/M” for messages only; “/N” for notes only; “/AI” for AI objections only; or for combinations of these fields—for example “/MN” for messages and notes only.

The examining attorney may also choose to modify a significance indicator, preferably a fuzzy-logic type slide-bar icon, which causes the attorney terminals to associate a level of priority with each message, note, or other communication received. The significance indicator may default at an average or middle setting and need not be adjusted. If selected as a set-up option, the significance indicator can be used, along with the age of the communication, to adjust the order of the stack of communications. The significance indicator can also be used to associate a window or window border color with the level of priority the communication carries. When set to a maximum value the significance indicator can also be used to send a flashing message to another terminal, indicating immediate attention is desired. This functionality exists on each of the terminals in the transcription network.

The examining attorney may also type to himself his own notes as the deposition is progressing. These notes may serve useful in that they may be reviewed later that evening in order to prepare for the next day of deposition, or they may be used at a later time during the deposition when a question becomes ripe. Most attorneys take notes during a deposition. Those who type proficiently may take notes with keyboard 253, and those who do not might use pen-based input as described more fully below in relation to FIG. 13.

Alternatively, the examining attorney may type or pen notes into his terminal before the deposition begins. In particular, the attorney terminals 15 and 17 provide an outline mode for hierarchically outlining by subject category and sub-category a series of ordered questions to ask the witness or to act as a reminder as to certain areas of questioning that must be covered during the examination.

Referring to FIG. 7 a, while off-line, upon entering the outline mode, the examining attorney uses the attorney terminal 15 to create a conventional outline of the names of categories and subcategories to be covered at a deposition using an outline window 285. As illustrated, an area of law such as “patent law” is entered as a hierarchical entry point by assigning it a Roman numeral as is in typical outlines. Each subcategory of patent law such as infringement, invalidity, laches, etc., is then entered thereunder. Similarly, the attorney continues to enter sub-subcategories and so on until all of the categories of patent law that need to be discovered are in the outline. Next, the attorney enters questions or notes under the appropriate categories in the outline. For example, under “best mode” the attorney may enter the question: “Did you disclose the most recent version of your invention in the patent application?” After all of the questions (or notes) have been entered, the attorney then estimates the amount of time that should be spent on each category. This time is entered through a command line 283 for use by a scheduler (described below in more detail).

After entering the entire outline, questions and estimated times, the attorney invokes a toggle command on the command line 283 which causes the outline structure to enter the graphical tree-structured display mode illustrated in FIG. 7 b. In this mode, the outline window 285 displays the created outline in a tree-structured format. Each category and subcategory can be easily discerned as well as the total estimated time to finish each hierarchical level. For example, in a block 287, the area of law, patent law, can be found to have an estimated time of eight (8:00) hours to complete. This time is comprised of one hour and thirty minutes (1:30) to complete “laches” at a block 289, five hours (5:00) to complete “invalidity” at a block 291, and one hour and thirty minutes (1:30) to complete “willfulness” at a block 293. In the preferred embodiment, the underlying questions or notes associated with each category or sub-category are not displayed in the tree-structured display. Toggling between the standard outline display and the tree-structured display can be repeated until preparation for the deposition is complete.

In addition to generating entire outlines from scratch, standard, reusable outlines pertaining to specific areas of law may be effectively utilized as a default. Once the outline is created, it may be re-used in subsequent depositions by merely adding or deleting questions where applicable.

Pretyped questions are used during the deposition by both the examining and defending attorneys to direct the proceedings. In addition, notes can be added detailing the current state of the law in each subcategory for reference during a deposition.

Referring back to FIG. 6 a, while in the outline mode during a deposition, the configuration of windows on screen 251 of the examining attorney's terminal 15 is identical to that described above with one exception. Instead of displaying Q & A's, the window 255 is used for an outline mode display. Particularly, whenever an outline command is selected from the command line 259, the window 255 displays the tree-structured outline illustrated in FIG. 7 b. From this outline, the attorney may choose to: 1) toggle to the standard outline, illustrated in FIG. 7 a, to update the outline; 2) evaluate and, if necessary, change the time estimates based on the deposition progression; or 3) select a category of questions and notes for immediate use.

By double-clicking the mouse button on the desired category (or through command selection via the command line 259), the pretyped questions and notes relating to the selected category are assembled onto a stack and displayed in the communication window 257 for immediate use. Upon user setup selection, the pretyped communications can be pushed onto the stack or be placed into a new independent stack which is solely displayed on the window 257. With the later selection, an individual stack is constructed for each and every subcategory in the outline.

As with off-line preparation, during the deposition, notes may be modified or added. Similarly, new categories may be added so that existing notes may be placed thereunder. To move an existing note, the note need only be selected via the window 257 and transferred by selecting a category via the window 256.

In an alternate embodiment, color coding is used to aid the identification of areas of remaining inquiry. For example: a grey category name indicates that all of the notes thereunder have been addressed; a yellow category name indicates that all of the notes thereunder need to be addressed; and a blue category name indicates that some of the notes thereunder need to be addressed. Similar color coding can be used for the specific notes. Colors can change automatically, upon detecting that all notes have been used, or manually by the attorney upon exhausting an inquiry without using all the questions or notes.

In addition, the terminals 15 and 17 record the range of the Q & A numbers which occur while within each specific category or subcategory. The range of Q & A's recorded can thereafter be used to aid in the review of areas of inquiry which are scattered throughout a transcript.

Objections to form must be seasonable, therefore timing is of utmost importance. To speed up the objection process, AI software routines analyze the form of each question and the content of the answer during and immediately after translation to provide various potential objections that the examining attorney may want to take into consideration in attempting to achieving good evidentiary form. For example, a search is made on each question for phrases such as “you said” or “you stated” or “you say”. If found, the AI routines immediately send a potential objection to the communication windows indicating that the question possibly mischaracterizes the witnesses earlier testimony. In response to the communication, the defending attorney may object, and may also initiate a search for the mischaracterized statement by opening the communication and pressing the scan up key 279. In this way, mischaracterizations may be avoided or corrected. Similarly, for example, answers are analyzed with focus on words such as “said” and “stated” to identify potential hearsay therein. The examining attorney may choose to object or ignore the communication.

As illustrated in FIG. 8, associate examining attorney terminal 17 includes a keyboard 301 having alphabetical keys arranged in a standard typing keyboard key pattern. As with the examining attorney's terminal 15, the associate examining attorney terminal 17 includes a screen 303 which is split into a transcription window 305 and a communication window 307 which have a common command line 309. While in the deposition mode, the terminal 17 displays the alphabetic and numeric text representing the spoken words in the transcription window 305, and the window 307 provides for input and display of messages, notes, etc. In the outline mode, the window 305 performs the hierarchical outlining functions described more fully above in relation to the terminal 15.

The associate attorney may also send messages to the examining attorney as the deposition is being carried out. The functionality of such communication is identical to that described previously regarding communication in the opposite direction with one enhancement. The associate's terminal 17 provides the capability of viewing, modifying, and sorting the communication currently present on the attorney terminal 15. Such capability allows the associate attorney to help organize the terminal 15, which has added importance for an examining attorney with little computer literacy.

AI objections are always associated with specific questions or answers. Messages may be associated upon selection via the command lines so that the recipient can place the question in context. Notes can be similarly associated. Important messages, notes, or objections can be recorded in a supplemental file by using the annotate command from the command line. Annotating creates a file of annotations which may or may not be associated with a specific Q & A, which may be stored with the transcript for later retrieval and review.

A demeanor marker command also provides for annotating a specific Q & A. Noticing that the witness turns red or begins to stutter, fidget, or raise their voice in response to a specific question, an attorney can select the demeanor command which presents a pretyped list of demeanor indications such as “fidgeting” or those listed which may be immediately associated with the current Q & A and stored as an annotation to the transcript. Similarly, a conferral command can be selected to annotate that the witness and opposing attorney conferred off the record regarding an associated Q & A. Additionally, although not generally useful during the deposition because they may be overheard, voice recorded notes or annotations may also be added by an attorney reviewing the transcript away from the deposition. Such annotations may be selectively displayed or played back via the attorney terminal's command line.

In the communication windows of each terminal, a stack containing a series of common pretyped messages (“common message stack”) can be quickly accessed for transmittal. For example, in the communication window 307 of the terminal 17, a series of common messages such as “What was the date, time, year?”, “Were there any witnesses to the event?”, or “What is his name, address, employer?” can be pretyped and listed in a message stack format in the window 307. The associate attorney need only select the desired pretyped message from the stack and select the send command. Similarly, potential objection messages can be added for quick access to the common message stack in the communication window of the associate defending attorney's terminal 18. The common message stack at each terminal can be customized by editing, adding or deleting messages therefrom.

A time command is also available for the command windows of the attorney terminals. Executing the time command enables the attorney to identify delay times between the end of a specific question and the beginning of the witness's answer. Similarly, using the time command, witness and opposing attorney conferral time can be recorded. Underlying this command, a system timing routine operates to log event timing in a separate data file which the time command can access for specific requests.

The system timing routine also operates as an alarm clock for scheduling a deposition. Referring back to FIGS. 7 a and 7 b, using the alarm command while preparing a hierarchical outline for the deposition, the attorney may allocate a projected amount of time for each category or subcategory. The projections can be monitored by the system timing routine, and indicate to the attorney through a system communication that projections are not being held. Specifically, while under a particular category, the system timing routine begins to count down the time allocated to the subcategory as described above. If the attorney completes the category before time runs out, the extra time will be added to a buffer. If instead time runs out, the alarm clock sends a communication to the attorney indicating that the projected time has expired and indicates the buffer status. The buffer is periodically updated by subtracting the time which lapses beyond the projected value. If after receiving the system communication the attorney remains in a category past the projected time period, the alarm clock sends a new system communication every thirty (30) minutes (as a default value).

As the deposition progresses, whenever the attorney enters the tree-structured outline mode, the time value displayed for each category, as illustrated in FIG. 7 b, reflects the time remaining in each specific category. The time value for the category currently selected (and its parents in the tree-structure) continually updates so as to display the scheduler's minute by minute count down. Rescheduling or other time value changes are automatically reflected throughout the entire tree-structured display, enabling better time management of the deposition.

When a confidential restriction is placed on specific subject matter of a deposition, the examining attorney often forgets to remove the restriction when the subject has changed. By selecting the alarm clock command and then the subsequently listed confidentiality wake-up command, the terminal 15 automatically sends a system communication every fifteen minutes (can be changed through a set up screen) to remind the examining attorney to clear the restriction from the transcript if possible. In addition, upon selecting a different category or subcategory in the hierarchical outline, a system communication is also immediately transmitted as a reminder. The confidentiality command also provides a mechanism for disabling and enabling local network communication from reaching off-site client terminals, unless specifically directed thereto.

In one embodiment of the present invention, translated sentences are communicated from CAT system 13 via serial communication along communication link 21 to terminal 15 and communication link 23 to terminal 17. Communications between terminals 15 and 17 utilize the two independent communication links 21 and 23. In another embodiment, the communication links 21 and 23 are interconnected and function as a single communication link (bus) using a TCP-IP protocol on an ethernet coaxial cable link using an ethernet board in each terminal. Two ethernet boards are desirable in the reporter terminal so that isolation can be maintained between the opposing sides. A software package “Berkeley Sockets”, developed at the University of California, Berkeley, is the preferred software base for maintaining the communication flow along the link. Packages implementing “Berkeley Sockets” are available through various third party vendors, including for example FTP Systems, Inc., Burlington, Mass. The communication links 20 and 22 are interconnected and function in a similar manner via the second ethernet board in the reporter terminal. Other hard-wired or wireless communication may also be implemented.

Two issues of isolation must be addressed by the communication links in the present invention. First, the court reporter stores all of the keystrokes pressed and all of the transcribed alphabetic and numeric text in the CAT system 13. These files must not be modifiable by any terminal on any down-line communication link. Such modifications would corrupt these files and the transcript would not be certifiable. Therefore, the communication out of CAT system 13 should only be that necessary to transmit copies of the stored files. More importantly, communication into CAT system 13 should only be that necessary to request transmission. Although additional communication features into and out of CAT system 13 may be added, they are not preferred.

Second, isolation must be established between the defending and examining terminals. Under no circumstances can the messages between the examining terminals be intercepted by the defending terminals. This requires that the communication links must be isolated. Isolation may be accomplished by using an independent hard-wired communication link as is illustrated in FIG. 1 via a serial or coaxial link, for example. Encryption of the communication exchanged may be used as a safeguard. However, if a wireless link is used, encryption becomes a requirement. Although various encryption schemes are contemplated, a preferred scheme involves the use of direct-sequence spread spectrum transmission. Such transceivers are relatively inexpensive and, by their very nature, provide encryption. Other exemplary communication links which might be implemented include token ring, parallel, infrared, RF (radio frequency), etc.

FIG. 9 illustrates additional features of the present invention which the court reporter and attorneys may utilize. As stated previously, the court reporter usually tapes the entire deposition or trial as a back up to the stenographic recording, and uses the audio tape to help in translating the proceedings. To avoid clumsy and time consuming searching through the audio tape to locate Q & A's, a synchronization scheme has been added to electronically associate each stenographically recorded Q & A with the actual position on the audio tape of the audio of the Q & A. With synchronization, the court reporter can quickly and easily locate audio on the tape.

Specifically, a tape recorder 351 is attached via a communication link 353 to the reporter terminal 13. Upon receiving an entire Q & A, the reporter terminal 13 requests and receives a position indication from the tape recorder 351. The marker position is generated by the tape recorder by a tape length counter 355 and associated circuitry. The reporter terminal 13 stores and associates each such position indication with the corresponding keystrokes. Thereafter, when necessary, the court reporter can display the position indicator and quickly locate the desired audio manually. Alternatively, communication link 353 can be used by reporter terminal 13 to communicate control signals to the tape recorder 351 so as to automatically position and play-back previously recorded audio upon request.

Instead of receiving position indications from the tape recorder 351, the reporter terminal 13 may send an audio signal representing an upcoming Q & A number to recorder 351 for recording. This audio signal may be either a voice-synthesized indication of the number count, aiding a manual search, or a digital data stream that is detectable by the recorder 351 using low or high speed searching directed via terminal 13.

Similarly, a video camera 361 may also be used to video tape of proceedings. Although video camera 361 may be able to independently store and play audio tapes, a separate video recorder, video recorder 363, is illustrated as being connected via a communication link 365 for this purpose. As with the audio recorder, video recorder 367 can be used in a synchronization scheme for associating and accessing video events with the transcribed text through communication link 369. A television monitor 371 provides for playing back recorded video information.

If the video or audio system is fully automated through the control of reporter terminal 13, upon a request from a judge or attorney to read back a Q & A, the court reporter may play back the associated audio and video by merely locating the requested Q & A on reporter terminal 13, possibly through a lexical search, and selecting a play-back command.

Also contemplated is the use object-oriented association of recordings using video frame grabbers and sound grabbers resident in reporter terminal 13 which digitally sample, store, associate and synchronize the video and audio of a proceeding. Synchronization would take place using a direct association of the transcribed text in a common programming object with the audio and video frames.

Using position synchronization for locating and playing audio and video also proves to be a valuable tool for the attorney. If the attorney desires to play portions of the audio or the video, for example, during trial, the attorney may use the lexical searching capabilities of the attorney terminals to locate the desired Q & A's, and may then automatically play the associated audio or video back for the judge and jury.

Referring to FIG. 10, a court room scenario is shown using the present invention. A first chair, examining attorney 401 holds a portable computer tablet 403 which has a wireless transceiver 405 attached thereto. The tablet 403 may provide pen-based or touch sensitive input and acts as a direct substitute to the attorney's paper note pad. In one embodiment, the wireless communication link is established using RF (radio frequency), spread-spectrum transmissions for the inherent data encryption associated therewith. As stated previously, other types of RF, ultrasonic or infrared transmission and data encryption schemes might also be used.

To guarantee isolation of the court reporter's transcription files, the terminal 13 only manages outgoing communication and requests therefore as described previously in relation to the configuration of FIG. 1. As explained above, although such communication exchanges involving the reporter terminal 13 do not require encryption, secure communications between the attorney terminals must exist to prevent interception by the opposing side (not shown). As illustrated, secure communication is accomplished between the terminals 15 and 17 using the independent hard-wired communication link 23. Encryption using a pseudo-random, secret spreading code common only to transceivers 405 and 409 implementing direct-sequence spread spectrum transmission secures the communication link between the attorney terminal 403 and either of the terminals 15 and 17. Other encryption schemes may also be used.

Wireless transmission may also be used by additional wireless transceivers (not shown) for use by additional on-site attorneys staffing the lawsuit. A communication line 411 may be used to establish a communication link with additional terminals used by attorneys, client's, or experts who are not in the court room, i.e. are remotely located off-site. Although the communication line 411 is a telephone line attaching to an internal modem of terminal 17, other communication methods such as a cellular RF link might be used. The communication line 411 provides for the active participation of off-site persons who might otherwise be unable, because of cost or time, to attend.

The local attorney terminals 403, 15, and 17 can control the flow of data through communication line 411. For example, if confidential information is being described, the attorney terminal 15 can block transmission off-site to a client by disabling outgoing communication via the off-site link via a command on the control line 259.

Together, the transceivers 407, 405, and 409 and the links 23 and 411 form a single communication link between the terminals. Channel isolation exists between the examining attorneys. File isolation exists at the CAT terminal 13. The characteristics of this single, overall communication link is transparent to the attorneys operating the terminals, they need only choose a destination and can send messages. Although not shown, the opposing attorneys also possess a similar communication system on a second, single communication link.

More specifically, the first chair, attorney 401, uses portable tablet 403 to record notes, receive translated Q & A's and exchange messages with the second chair at terminal 15. The first chair can either direct the message to one of the supporting attorneys at a terminal or globally send messages to all available attorneys at all of the terminals on the link. Similarly, messages can be received from such terminals. Although not necessary, a communication pathway hierarchy is preferred wherein all messages directed between the first chair and lower numbered chairs are intercepted by the second chair terminal for screening and redirection. In this way, the second chair attorney prevents the first chair attorney from being distracted by unnecessary, unimportant, or misdirected messages. Additional levels of hierarchy between the second and lower numbered chairs may also be implemented.

A terminal such as portable tablet 403 or attorney terminal 15 might also be used by the judge during and after trial for reviewing and analyzing the lawsuit. When facing a ruling on the matter, the judge might also use the features described above to determine whether proper evidentiary form was achieved in the Q & A. Similarly, for review and analysis, the jury might be provided such terminals during trial or deliberation to help reach their decisions. Especially beneficial in either situation would be the ability for audio and/or video review as described in relation to FIG. 9 above.

FIG. 11 illustrates an alternate embodiment wherein, instead of responding indirectly to stenographic keystroke translation, the reporter terminal 13 directly performs voice transcription and association. Added to the reporter terminal 13, a microphone 451 generates an analog voice signal representative of the words spoken during a proceeding. The analog voice signal passes along a communication link 453 to an A/D (analog to digital) converter board housed within reporter terminal 13. This A/D converter samples and digitally converts the analog voice signal to a series of digital values representative of the words spoken at the proceeding. Linear predictive coding (“LPC”) techniques are then used either remotely (through an off-site computer link) or locally (within reporter terminal 13) to transcribe the deposition.

Additionally, the reporter terminal 13 may also use the stenographic keystroke input as a basis for training the voice transcription system. Specifically, the reporter terminal 13 breaks the digital voice signal into segments, and, using LPC techniques, synchronizes through association groups of the segments with corresponding keystrokes from the stenographic recorder 11. The process of associating the segmented digital voice signal with the corresponding stenographic keystrokes provides a method for training the voice transcription of the reporter terminal 13. Once trained, voice transcription may more effectively supersede the stenographic recorder 11 in generating a written transcript of the proceeding.

FIG. 12 is used to illustrate two alternate embodiments where the court reporter is not linked to the transcription network. In a first embodiment, voice transcription is directly managed by an attorney terminal 555 using a microphone 551 via a line 559. Voice transcription can occur either locally, within terminal 555, or remotely via a communication link 565. Additionally, as described previously, transcription, association, synchronization, and message exchanges would occur between terminal 555 and an associates terminal 563.

A second embodiment is illustrated wherein terminals 555 and 563 provide for message exchange and message association with the corresponding recorded audio on the tape, transcription not being available. Message association would occur using by either retrieving and storing the position indication from recorder 553, or sending for recording, audio or digital signals for manual of automated play back. A more detailed description of the functionality of such message association and play-back can be found above.

FIG. 13 is a perspective view of another embodiment of the present invention. Instead of using a keyboard, the attorney terminals 15 and 17 might adopt pen-based or touch-sensitive screen input. Pens 601 and 603 and the screens 251 and 303 provide an attorney a substitute for a pad and Post-it® brand notes. The associate attorney may open a computerized representation of the Post-it® brand note (a “computer note”), scribble a message thereon using the pen 603, and press the send command on the command line 309. Unlike actual Post-it® brand notes, computer notes can be expanded. Each expanded computer note is displayed in the default size yet has an associated scroll-bar and expand and reduce buttons that can be manipulated so that an oversized message can be fully read.

The terminal 17 directly sends the entire bit-mapped image of the computer note to the attorney terminal 15 for display in a tiled format (cascading may also be selected as a default) on the screen 251. The attorney operating the terminal 15 may then modify, delete, move, etc., the received computer note using the pen 601 and the commands from the command line 259. Message exchanges in the opposite direction and note taking, as previously described, works in a similar manner. Moreover, instead of sending a bit-map of the computer note, handwriting recognition software may be used to convert the note to a text format to make the notes more legible to the recipient. The recognition software may be selected directly through command line 259 or through a set-up default.

The transcription network of FIG. 13 is illustrated to operate without transcription input; however, if present, transcribed text can also be displayed by opening transcription windows as described above.

As with the previous embodiments, the associates terminal 17 has the ability to display and manipulate the messages displayed on the attorney terminal 15. For example, the associate can: 1) delete computer notes that have been used or become stale; 2) replace incomplete or unrecognizable notes; 3) rearrange the tiled or cascaded notes to better communicate priority or to organize subject matter; or 4) modify the significance indicator causing message border colors to change or flash (described more fully above) on the attorney terminal 15 indicating message priority.

FIG. 14 illustrates a flow-chart describing the communication protocol used by the attorney terminals for communicating with the reporter terminal (or CAT system) 21. Ordinarily, at the completion of a transcription of a full Q&A, the reporter terminal 13 transmits the Q&A down-line to along communication links 21 and 20, for example. If the attorney terminals, such as the terminals 15, 17, 16 and 18, are listening, they receive and update their screen displays appropriately. A specific protocol has been implemented to accommodate: 1) requests by the attorneys for incomplete, preliminary translations; 2) attorney terminals which have not “heard” all of the Q&A transmissions; and 3) screening of Q&A's by a senior chaired attorney to those down-line such as remote, client terminals.

Specifically, as represented by a begin block 651, upon connecting to the communication link with the reporter terminal, the attorney terminals initiate the communication protocol. At a block 653, the attorney terminals construct a log-in packet, a packet of bytes of digital information, which consists of: 1) a source identifier, identifying itself as the source of the communication; 2) a destination identifier, identifying the reporter terminal as the destination; and 3) a pointer indicating the attorney terminal's current position, i.e., the pointer identifies the last Q&A that the attorney terminal has received from the reporter terminal.

The pointer to the last or most recent Q&A is referred to hereafter as a “self.current” pointer. If an attorney terminal is logging in for the first time in a deposition, the value of the “self.current” pointer is equal to zero (0) to indicate that no Q&A's have yet been received. Likewise, if the attorney terminal is logging in a second or subsequent time (because of either an accidental or purposeful disconnection), the “self.current” pointer is set to the last Q&A number received prior to the disconnection.

As represented by a block 655, the attorney terminal sends the log-in packet to the reporter terminal, and waits for a response. Upon receiving the log-in packet, the reporter terminal enters the source identifier into a log-in list, stores the “self.current” pointer, and transmits a responding packet containing a source identifier, destination identifier, and the number of the most recently translated Q&A. The most recent Q&A number is maintained by the reporter terminal's “rptr.current” pointer.

Upon receipt of the response packet, as illustrated by a block 657, the attorney terminal first determines the number of outstanding packets needed. This is done by subtracting the attorney terminal's “self.current” pointer value from the received, reporter terminal's “rptr.current” pointer value. The resulting difference (the number of needed Q&A's) is then compared to a threshold value. If the difference is less than the threshold value (a “yes” to the decision at block 657), the attorney terminal builds an update request packet, containing source, destination and update indicators, and transmits the packet as represented by blocks 661 and 663, respectively. In this instance, the update request packet is very similar to the log-in packet, containing: 1) a source identifier; 2) destination identifier; and 3) pointer to the attorney terminal's current position.

If the difference is greater than the threshold value (a “no” at block 657), the attorney terminal prompts the attorney using the attorney terminal for the range of Q&A's that are actually necessary, as represented by a block 659. The range is selected by choosing the earliest Q&A desired. The range would then be from the earliest Q&A to the most current Q&A as determined by the reporter terminal. The prompting for a range enables the attorney to evaluate the number of outstanding Q&A's that have not been received, and request only those that are truly needed. The thresholding prevents large deposition file downloading to late-coming attorneys where unnecessary. Thereafter, at the block 661, the attorney terminal constructs an update request packet using the selected range, and transmits it to the reporter terminal at the block 663. In this instance, the update request packet contains: 1) a source identifier; 2) destination identifier; and 3) the number of the earliest Q&A desired.

At block 663, after transmitting the update request, the attorney terminal awaits an update. The reporter terminal responds by sending the requested range. However, if the requesting attorney terminal or a higher chaired attorney has placed a Q, A or Q&A restriction upon the requesting terminal for all or part of the requested range of Q&A's, the reporter terminal transmits substituted text, such as “Restricted viewing!” instead of the actual text translation where necessary. The reporter terminal tracks restrictions in the log-in table. Similarly, whenever a Q&A translation is complete, the reporter terminal only transmits the text of the translation to those attorney terminals having no restrictions placed thereon. To those attorney terminals with restrictions placed thereon, the substitute text is transmitted.

Upon receiving the requested Q&A's or substitute text therefor, as represented by a block 665, the attorney terminal stores the requested Q&A's and updates its “self.current” pointer. At this point, the attorney terminal is considered up to date. Thereafter, both periodically and upon an attorney's direct request (in response to specific keyboard or mouse input), the attorney terminal constructs an update packet, at a block 667, and repeats the cycle beginning with block 655. This looping cycle ensures that the attorney terminal stays up to date, even if the attorney terminal for any reason temporarily stops monitoring the communication link during the deposition.

Whenever a request packet is received which contains a “self.current” pointer which is equal to the reporter terminal's “rptr.current” pointer, the current Q&A pointed to by the “rptr.current” pointer is resent along with any partial translation for the next Q&A that has been completed. The current Q&A is always resent to make sure that the attorney terminal possesses the full translation thereof (in case only a partial translation is currently stored). The transmission of the partial translation of the subsequent Q&A allows the specifically requesting attorney to display the ongoing translation.

The value of the threshold depends on nature of the communication network and user preference. If the network is sluggish, the threshold should be set lower to off-load competing communication on the channel. To accommodate each user's preference, control of the threshold value is be placed at each attorney terminal.

In addition, although the thresholding decision is made by the attorney terminal as represented by the block 657′, in an alternate embodiment, the reporter terminal makes the decision. Specifically, referring to FIG. 15, a flow-chart of an alternate embodiment of the software protocol used by the reporter terminal 13 is shown. Upon receipt of either a log-in or update request packet, the reporter terminal initiates its response as represented by a begin block 681. At a block 683, the source indicator is compared to the log-in list to determine whether the currently requesting attorney terminal has been logged therein. If not, the source indicator (or source identifier) is added to the log-in list.

At a decision block 685, the reporter terminal compares the received “self.current” pointer with its “rptr.current” pointer. If the difference between the two is less than a threshold value (as indicated by the “yes” path), the reporter terminal: 1) checks for viewing restrictions in the requested range, at a block 691; 2) constructs a response which includes all of the requested Q&A's with substituted text where restricted, at a block 693; 3) transmits the response to the requesting attorney terminal, at the block 695; and 4) exits the routine at the end block 697. However, if the difference at the decision block 685 is greater than the threshold value (indicated by the “no” pathway), prior to encountering blocks 691, 693, 695 and 697, the reporter terminal first prepares and sends a range packet inquiry to the requesting terminal and awaits an update response therefrom, respectively at blocks 687 and 689. The range inquiry packet indicates the total range of outstanding Q&A's. The attorney terminal responds by prompting the user for the earliest desired Q&A, constructs another update request packet, and transmits it to the reporter terminal which awaits its receipt at the block 689. From this responsive update request packet, the reporter terminal completes the protocol through the remaining blocks as described above.

FIG. 16 is a perspective view of an alternate embodiment of the present invention not only providing for review and analysis of synchronized audio, video and textual transcripts, but also providing real-time audio and video on demand via a remote relaying and storage system 955. The CAT system 13 receives audio and video signals from the video recorder 957, processes and transmits them to the remote relaying and storage system 955. Synchronization information and the textual record are also transmitted.

The first function of the system 955 is to provide repeater functionality wherein the incoming signal is relayed to any number of selected communication ports providing access to remote terminals. In this way, the remote terminals, such as remote terminals 951 and 953, may receive virtual real-time video and audio on demand. In addition, the system 955 performs storage and play-back of the video and audio transcripts along with the textual transcript and synchronization information so that the transcript can be selectively parsed and reviewed during and after the proceeding.

To store the video transcript, the system 955 first converts the incoming video signals into any one of a number of established bit-parallel digital interface formats for component video. The CCIR recommendation 601, the SMPTE RP 125, and the SMPTE 240M-1988 standards are exemplary formats which may be adopted. For example, to achieve the CCIR recommendation 601 standard, the CAT system 13 divides the video signal into three components, a luminance signal (Y) and two color difference signals (R-Y, B-Y). The luminance component is sampled into the digital domain at 13.5 MHz and the color difference components are sampled at 6.75 MHz each. One byte of color difference is transmitted (byte serial over eight differential conductor pairs with a ninth pair carrying a clock signal of 27 MHz) followed by a luminance byte which is then followed by the second color difference component. The net result is a 27 Mbytes/second digital data stream that can pass through hundreds of meters of cables and through many stages of processing without degradation.

To capture and play-back the video, the storage system 955 incorporates a series of storage disks arranged in parallel. This might be accomplished by combining several heads in a single large disk or by combining several small disks (often referred to as RAID architecture). However, preferably, the system 955 incorporates real time disks grouped in a modular fashion. For example, two subsystems of six disks combined to transfer at a sustained rate of 26.8 Mbytes/second will support the CCIR 601 standard. Recognition Concepts, Inc., manufactures a DataSTORE® Real Time Disk series for such purposes.

A variety of input devices, including video camera 951 might also be converted to the CCIR standard. Encoders and decoders are available to allow inputs and outputs from PAL, NTSC, RGB, YUV, and CCIR 601, permitting many different formats to be conveniently reformatted into a single standard digital path for temporary or permanent storage onto magnetic disk or tape.

In addition, the storage system 955 may act as a “warehouse” for storage of the audio, video and textual transcripts for post-proceeding review. Individuals may, via remote terminals such as the terminals 951 and 953, access any prior stored data. Because the system 955 also provides storage for the synchronization information as mentioned above, the video and audio transcript can be fully synchronized with the corresponding textual record.

FIG. 17 is a perspective view of an embodiment of an overall transcription system according to the present invention providing billing monitoring, collection and transcription authorization. A stenographic recorder 11 is used by a court reporter at a deposition, hearing or other transcription proceeding to record digital coded signals representative of verbal communications as they are spoken. Using a communication link 19, the stenographic recorder 11 transfers the representative signals to a computer aided transcription (CAT) system 13, a computer terminal used by the court reporter for automated transcription. In real time, the CAT system 13 produces a textual record of the proceeding, which corresponds to the actual words spoken.

The CAT system 13 communicates the textual transcript as it is created in real time along a communication link 20 to a variety of remote or local terminals, such as attorney terminals 15 and 17, for review.

The CAT system 13 may selectively charge a client on any one of (i) a time of use, (ii) a word by word basis or (iii) a page by page basis. Such decision can be made by the court reporter upon initial installation and setup of the CAT system 13, or may be made on a use by use basis.

The CAT system 13 operates in at least two exemplary modes for revenue collection through coordination and monitoring by a billing system 630. Any of the modes of operation are available through configuration of the CAT system 13, and through interaction with the billing system 630.

In a first mode of operation, the CAT system 13 interacts with the billing system 630 to preorder a certain volume of transcription, e.g., authorizing a future amount of hours, future number of words, or future number of pages. Either the billing system 630 automatically charges, for example, a credit card, or may directly bill the court reporter registering therewith. Thereafter, when the amount of remaining prepaid services drops to a low threshold level, the CAT system 13 informs the court reporter and suggests further prepaying interaction with the billing system 630. If the court reporter fails to prepay for further transcription authorization before the remaining amount is used, the CAT system 13 enters an emergency mode of operation wherein the court reporter is regularly warned that a threshold amount of transcription, typically one full days worth, will be allowed with none permitted thereafter. If prepayment is made before the threshold is reached, the court reporter will pay for the amount owed before being permitted to prepay. Once the threshold has been reached, the CAT system 13 will prevent further use of the real time viewing any terminal other than the CAT system 13.

In a second mode of operation, the CAT system 13 operates to require payment for transcription services rendered so long as ceiling amount is not exceeded. The court reporter using the CAT system 13 may interact with the billing system 630 to pay off any amounts previously incurred. If the amount incurred reaches a first threshold, warning messages will be given. If the ceiling amount is reached, the CAT system 13 will lock out real time down line viewing by any other terminal.

Of course, in either the first or second mode, different or additional numbers of stages of warnings and different or additional restrictions on full use of the real time transcription output can be employed to ensure payment.

The billing system 630 can be accessed online, e.g., via the Internet or via a private dial-up network.

Typically, such billing functionality of the CAT system 13 is built entirely within the transcription software running on a computer system that underlies the CAT system 13. However, a dongle or other hardware may be used which itself counts or otherwise tracks and limits usage of the CAT system 13. The dongle can be reset through interaction with the billing system 630, automatically or through a court reporter's manual interaction therewith. A dongle often attaches to a parallel or serial port of a computer, but may also comprise a circuit board for example.

The billing system 630 also delivers a printed invoice automatically based on a court reporter's registration information. Similarly, the CAT system 13 automatically generates invoices not only including the charges passed through to the billing system 630, but also including the court reporters charges to the attorneys or others employing such transcription services. The court reporter need only input the amount that they are charging per hour, per page or per word. The invoice may then be delivered directly along with the transcript itself after editing, or immediately after a transcription preceding has ended.

It is anticipated that the billing system 630 be used to capture and report on overall usage information as well as track the court reporters themselves. To this extent, the billing system 630 can be used by the CAT software manufacturers and/or by court reporting companies or organizations.

The billing system 630 may be, for example, incorporated in the relaying and storage system 955 discussed above with respect to FIG. 16.

FIG. 18 is a block diagram of one embodiment of the network of FIG. 16. FIG. 18 illustrates user access to transcripts via a web server 721 that is communicatively coupled to the CAT System 13. The court reporter uses the stenographic recorder 11 and the CAT system 13 to transcribe spoken words into text, which is then, in-real time, communicated to the web server 721, for subsequent access by a subscriber using Subscriber's computer 723 or by a web user using a Web Client Environment 725. The web server 721 may selectively provide access to textual transcripts to Subscriber's computer 723 or by a web user using a Web Client Environment 725 in real time mode or in non-real time mode. In addition, the web server 721 selectively saves transcripts in its persistence mechanism 735 (such as hard disks, writable CD-ROMs, tape, etc.) for subsequent access by the subscriber or the web user.

Typically, the subscriber establishes an account with a subscription manager 739 using subscription web pages provided by the web server 721. Subsequently, the subscriber is provided access to transcripted web pages saved in the persistence mechanism 735. Subscribers can subscribe to real-time delivery of transcripts, non-real time access (i.e. saved transcripts), or to both.

Certain transcripts are selectively made available, either in real-time mode or in saved mode, to web users using the Web Client Environment 725. Typically, the web user is provided transcripts on a test basis after the web user establishes a temporary subscription or registers with the Subscription Manager 739. The Subscription Manager 739 may incorporate the billing monitoring, collection and transcription authorization functionality discussed above with respect to the billing system 630 of FIG. 17. Web users can also selectively obtain access to specific transcripts, in real-time mode or from those saved in the persistence mechanism 735, after paying for the transcripts using, for example, their credit cards. Such credit card purchases are handled by a transaction system 741 via web pages provided by the web server 721.

In one embodiment of the present invention, the court reporter interacts with administrative web pages provided by the web server 721 to set up a live transcription event for real-time transmission of transcripts from the CAT System 13 to the web server 721. In this embodiment, the court reporter can also selectively identify the recipients of the transcripts, such as the attorneys of both sides of a litigation, deposition, or a court proceeding. The court reporter can also specify if portions of the transcripts can be made available to a wider audience. Such specification of recipients may selectively be modified by the subscription manager 739.

In another embodiment of the present invention, all transcripts created by the court reporter, with the help of the stenographic recorder 11 and the CAT system 13, are saved in the persistence mechanism 735. The court reporter and other subscribers may retrieve all or some portions of the transcripts from the web server 721 under the control of the subscription manager 739. Such retrieval of transcripts may be in the form of downloaded files over the internet, attachments to email, printed pages or mail by traditional post. All such retrieval are monitored and adjustments are made to the court reporters accounts and subscriber accounts as appropriate.

In another embodiment, court reporters and attorneys are also treated as other subscribers when they try to access transcripts from the web server 721, i.e., all subscribers including court reporters and attorneys are charged for the transcripts they retrieve. However, court reporters are allowed to transmit transcripts to the web server 721 from the CAT system 13 using a pre-established account.

It should be clear that the embodiments of the present invention described hereinabove are merely illustrative and that other modifications and adaptations may be made without departing from the scope of the appended claims. 

1. A method for real time transcription of spoken words to a textual format comprising: communicatively coupling a computer aided transcription system to a remote system; identifying, by the remote system, the computer aided transcription system communicatively coupled thereto; communicating, by the remote system, to the computer aided transcription system an authorization for a predetermined amount of transcription; transcribing, by the computer aided transcription system, representations of spoken words to text in real time; disabling, by the computer aided transcription system, the transcribing after the predetermined amount of transcription has been reached; and charging by the remote system a credit card for the predetermined amount of transcription.
 2. The method of claim 1 further comprising indicating by the computer aided transcription system that the predetermined amount of transcription is about to be reached.
 3. The method of claim 1 further comprising charging by the remote system a credit card for the predetermined amount of transcription.
 4. The method of claim 1 further comprising generating by the remote system a bill for the predetermined amount of transcription.
 5. The method of claim 1 further comprising enabling by the computer aided transcription system the transcribing after a payment is made to the remote system.
 6. The method of claim 5 wherein the enabling is responsive to a communication from the remote system.
 7. The method of claim 1 wherein the predetermined amount of transcription is based on one of time transcription, number of words transcribed, and number of pages transcribed.
 8. A method for real time transcription of spoken words to a textual format comprising: transcribing, by a computer aided transcription system, representations of spoken words to text in real time; monitoring by the computer aided transcription system, the transcribing; and disabling, by the computer aided transcription system, the transcribing after a predetermined amount of transcription has been reached; and communicatively coupling the computer aided transcription system to a remote system; identifying by the remote system the computer aided transcription system communicatively coupled thereto; communicating by the remote system to the computer aided transcription system an authorization for a further predetermined amount of transcription; and enabling by the computer aided transcription system the transcribing.
 9. The method of claim 8 further comprising indicating by the computer aided transcription system that the predetermined amount of transcription is about to be reached.
 10. The method of claim 8 wherein the predetermined amount of transcription is based on one of time of transcription, number of words transcribed and number of pages transcribed.
 11. The method of claim 8 wherein the communicating is responsive to payment for the further predetermined amount of transcription.
 12. The method of claim 8 wherein payment is made pursuant to at least one of a credit card payment to and a bill generated by the remote system.
 13. The method of claim 8 further comprising: communicatively coupling the computer aided transcription system to a remote system; and generating by the remote system a bill for the predetermined amount of transcription.
 14. A method for transcription of spoken words to a textual format comprising: communicatively coupling a computer aided transcription system to a remote system; transcribing by the computer aided transcription system representations of spoken words to text; communicating by the computer aided transcription system the text to the remote system; communicatively coupling a terminal to the remote system; communicating by the remote system at least a portion of the text to the terminal; ceasing communications by the remote system to the terminal after a predetermined amount of transcription has been reached; and generating a bill based on the amount of transcription.
 15. The method of claim 14 wherein generating a bill based on the amount of transcription comprises determining one of the amount of time the remote system is communicating text to the terminal, the number of pages of transcript the remote system is communicating to the terminal, and the number of words the remote system is communicating to the terminal.
 16. The method of claim 14 further comprising storing by the remote system in memory the text communicated from the computer aided transcription system.
 17. The method of claim 16 further comprising retrieving the text from memory.
 18. The method of claim 17 wherein the communicating of the text to the terminal is in real time.
 19. A method for real time transcription of spoken words to a textual format, comprising: communicating to a computer aided transcription system an authorization for real time transcription of spoken words; and automatically indicating for a user of the computer aided transcription system that an amount of real time transcription of spoken words has occurred; and wherein communicating to the computer aided transcription system the authorization for an amount of real time transcription, includes: communicatively coupling the computer aided transcription system to a remote system; identifying, by the remote system, the computer aided transcription system communicatively coupled thereto; and communicating, by the remote system, to the computer aided transcription system an authorization for predetermined amount of transcription.
 20. The method of claim 19, wherein communicating to the computer aided transcription system occurs prior to automatically indicating that the amount of real time transcription of spoken words has occurred.
 21. The method of claim 19, further comprising: transcribing, by the computer aided transcription system, representations of spoken words to text in real time; and disabling, by the computer aided transcription system, the transcribing after the predetermined amount of transcription has been reached.
 22. The method of claim 19 further comprising charging by the remote system a credit card for the predetermined amount of transcription.
 23. The method of claim 19 further comprising enabling transcription by the computer aided transcription system after an electronic payment is made.
 24. The method of claim 19 wherein the amount of transcription is based on one of time transcription, number of words transcribed, and number of pages transcribed.
 25. The method of claim 19 wherein automatically indicating for a user of the computer aided transcription system that an amount of real time transcription of spoken words has occurred includes automatically providing information regarding purchase of transcription related services. 